Unsupervised learning of Arabic non-concatenative morphology
نویسنده
چکیده
منابع مشابه
Learning non-concatenative morphology
Recent work in computational psycholinguistics shows that morpheme lexica can be acquired in an unsupervised manner from a corpus of words by selecting the lexicon that best balances productivity and reuse (e.g. Goldwater et al. (2009) and others). In this paper, we extend such work to the problem of acquiring non-concatenative morphology, proposing a simple model of morphology that can handle ...
متن کاملInduction of Root and Pattern Lexicon for Unsupervised Morphological Analysis of Arabic
We propose an unsupervised approach to learning non-concatenative morphology, which we apply to induce a lexicon of Arabic roots and pattern templates. The approach is based on the idea that roots and patterns may be revealed through mutually recursive scoring based on hypothesized pattern and root frequencies. After a further iterative refinement stage, morphological analysis with the induced ...
متن کاملSemi-Supervised Learning of Concatenative Morphology
We consider morphology learning in a semi-supervised setting, where a small set of linguistic gold standard analyses is available. We extend Morfessor Baseline, which is a method for unsupervised morphological segmentation, to this task. We show that known linguistic segmentations can be exploited by adding them into the data likelihood function and optimizing separate weights for unlabeled and...
متن کاملA Probabilistic Model for Learning Concatenative Morphology
This paper describes a system for the unsupervised learning of morphological suffixes and stems from word lists. The system is composed of a generative probability model and hill-climbing and directed search algorithms. By extracting and examining morphologically rich subsets of an input lexicon, the directed search identifies highly productive paradigms. The hill-climbing algorithm then furthe...
متن کاملAgreement and Plural features in Heritage Arabic Speakers
31 Agreement and Plural features in Heritage Arabic Speakers Studies of heritage speakers of Spanish and Russian have reported that verbal and nominal morphology are vulnerable areas for L1 loss and incomplete acquisition. In this paper, we will report on on-going experimental research on the vulnerability of nominal and verbal agreement in Heritage Arabic speakers, since Arabic presents comple...
متن کامل